Prosodic control in Chinese TTS system
نویسندگان
چکیده
In this paper, the prosodic control strategy is discussed under the collectivity of Chinese TTS system design. A four level (syllable, prosodic word, prosodic phrase and sentence) pitch modification and multiplicative duration model are suggested. Although the prototype of models was formed in 1994, the subsequent results of concerned research based on large speech databases are also represented, which effectively advance to perfect the prosody control mode of the Chinese TTS system.
منابع مشابه
Training prosodic phrasing rules for Chinese TTS systems
This paper describes several experiments designed to train prosodic phrasing models for Chinese TTS systems and to investigate the underlying rules that control Chinese prosody. First, we collected 559 sentences from news programs and built a large corpus for modeling Chinese prosody. Second, we selected 20 features and used classification and regression trees (CART) and transformational rule-b...
متن کاملطراحی و ارزیابی یک مدل بازسازی گفتار به روش همگذاری واحدهای حساس به بافت نوایی
This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...
متن کاملAnnotation of Chinese Prosodic Level Based on Probabilistic Model
In this paper, a probability based method is proposed for the annotation of Chinese prosodic levels. We investigated the acoustic correlates (F0 pitches, durations and pauses) of the prosodic boundaries and tried to annotate the levels of the boundaries using Gaussian model and Bayesian decision. The method allows efficient and automatic labeling for the large scale speech corpus and can be use...
متن کاملA probabilistic approach to prosodic word prediction for Mandarin Chinese TTS
Prosodic word is a basic rhythmic unit of Mandarin Chinese Speech. It is one of the most important factors determining the naturalness of the generated speech by a TTS system. This paper investigates the problem of predicting Chinese prosodic words from word sequence. First, we examine the patterns of Chinese prosodic words and investigate the key features for prediction. Then a baseline model ...
متن کاملAn NN-based Approach to Prosodic for Synthesizing English Words Em
In this paper, a neural network-based approach to generating proper prosodic information for spelling/reading English words embedded in background Chinese texts is discussed. It expands an existing RNN-based prosodic information generator for Mandarin TTS to an RNN-MLP scheme for Mandarin-English mixed-lingual TTS. It first treats each English word as a Chinese word and uses the RNN, trained fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000